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The rest frame subjet algorithm is introduced to define the subjets for the SISCone jet; with 
this algorithm, an infrared and collinear safe jet shape observable N-subjettiness, t 3 n , is defined to 
discriminate the fat jet, from a highly boosted color singlet particle decaying to N partons, from the 
QCD jet. Using rest frame subjets and r| on dijets from highly boosted H/W/Z bosons through 
pp — > HW , HZ with run = 120GeV, we found that statistical significance of the signal, from the 
fully hadronic channels, is about 2a for 14TeV collisions with C ~ 30fb -1 . 



Current electroweak precision fits of the standard 
model(SM) prefer a light Higgs boson[T]; experimental 
bounds from LEPpQ and Tevatron[2] suggest the SM 
Higgs mass, mji, is about 120GeV. However, the SM 
Higgs boson mg < 135GeV has been considered hard 
to discover because it dominantly decays to a b-quark 
pair. Unlike the signals from leptonic decay modes, 
those from hadronic decay channels undergo overwhelm- 
ing QCD background processes. Finding the Higgs with 
m H ^5 135GeV requires combining many possible decay 
channels such as H — > 77, WW, bb and tt. 

Recently, there has been a study on the jet substruc- 
ture via pp — > WE, ZH where H is highly boosted to 
form a single fat jet, and W/Z decays leptonically[3]. 
According to ATLAS detector simulation [1 , it is ex- 
pected to also be a promising channel for the Higgs 
search. Other subjet techniques are also proposed for 
reconstructing the mass peak of heavy particles [1HZ] 
and reducing the QCD contamination effects[5]. Jet 
radius also plays an important role for the mass peak 
reconstruction [3], and several schemes for optimizing jet 
radius to improve the invariant mass distribution are pro- 
posed PZI1EEI]. 

Beside jet substructure, ref. [12] suggested 'template 
overlap' method to match jet energy flow with the par- 
tons directly. The property of color structure of the Higgs 
decay has also been studied for searching new particle via 
the double diffractive process [13] and the Higgs decaying 
to the two b-tagged jets (13) . 

To search heavy particles, however, those techniques 
usually require signal processes to involve additional un- 
usual signatures, such as lepton or missing-p^, to sup- 
press QCD background; fully hadronic decay channels, 
such as Higgs production associated with W/Z boson, 
pp — > HW, HZ, are still considered too hard to be used 
for the heavy particle study. Usually, however, cross- 
sections of hadronic decay channels are, at least, few 
times larger than (semi-)leptonic decay channels; utiliz- 
ing them will improve, for instance, the Higgs discovery 
potential. 

In this paper, to identify boosted color singlet parti- 
cles, we introduce the subjet definition for SISCone jet 
[15] and an infrared and collinear safe jet shape observ- 
able, t j n . With 7-3, as an illustration, we investigate the 



statistical significance of the signals from fully hadronic 
decay channels of light SM Higgs. 

SISCone jet algorithm is one of the best algorithms 
for reconstructing mass peaks, i.e. it has low sensitivity 
to underlying event(UE) contamination [9] I16j. In many 
cases including light SM Higgs searches, however, such 
mass peaks are ruined by huge QCD background con- 
tributions. To suppress the background, the underlying 
structure of the jet is essential, but SISCone jet algo- 
rithm misses the informations. Moreover, the jet sub- 
structure is also required for the boosted Higgs jet tag- 
ging. The boosted Higgs jets contain not a single, but two 
secondary vertices originating from two b-quarks. The 
identification of the two b-quarks is crucial for separating 
the signal from the large background; since gluon split- 
ting into b-quarks is major source of QCD background, 
the directions of the subjets can be used to reduce the 
background [I]. Previously, such two b subjets tag- 
ging algorithms are devised for sequential recombination 
jet algorithms; for example, Cambridge/Aachen(C/A) 
algorithm|17j. With a subjet definition for SISCone jet, 
however, those two b subjets tagging algorithms can also 
be employed with SISCone jet, too. These reasons have 
motivated us to devise a subjet algorithm using the SIS- 
Cone jet algorithm for finding a boosted particle while 
reducing QCD background. 

We considered several possible ways to define the sub- 
jet of the SISCone jet. One is, for a given jet, clustering 
its constituents with a sequential recombination jet algo- 
rithm such as C/A with mass drop and filtering(MD-F) 
techniques [3J. However, sequential recombination jet al- 
gorithms require a larger initial jet radius than SISCone 
jet algorithm to catch perturbative radiation of the par- 
ticle; applying C/A MD-F to the SISCone jet is less effi- 
cient than using C/A MD-F solely. Other one is cluster- 
ing the constituents of the jet using SISCone algorithm 
with a smaller jet radius, i.e. applying the jet trimming 
algorithmjS]. As the Higgs is boosted highly, however, 
the cones which define subjets are likely to overlap. Al- 
though SISCone algorithm merges the overlapping cones 
or splits them according to the overlap threshold param- 
eter, we found that this subjet definition is inefficient, 
too. Last method we considered is to define the subjet 
in the 'rest frame' of the jet. 
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FIG. 1: Illustration of the jet and the rest frame subjets. a) 
The lab frame, b) The jet rest frame. 



The fat jet from a boosted color singlet particle is, a 
good approximation, a closed system, i.e. it does not 
interact with the rest of the system when it hadronizes. 
Thus, the rest frame of the particle is identical to the CM 
frame of its decay products. In contrast, a jet from a col- 
ored particle is not a closed system. Although colored 
partons are almost free at high energy due to asymptotic 
freedom, they must hadronize and the hadronization pro- 
cess involves exchanging the four momentum. Because 
the four momentum of the jet is different from that of 
the hard parton, the rest frame of a colored particle has 
ambiguities. 

We define the jet rest frame as a frame where the four 
momentum of the jet equals p r ^ st = (m^,, 0,0,0). 1 A 
jet is consisted of its constituent particles. The distribu- 
tion of constituent particles of a fat jet, in their center of 
mass frame, is nearly identical to those of the color sin- 
glet particle which is produced at rest. Thus, we recluster 
them in the jet rest frame, i.e. using their four momenta 
at the rest frame of the jet. For a given jet, the 'rest 
frame subjets' are defined as these reclustered jets. This 
procedure is illustrated in Fig. [I] 

UE affects the jet mass and, thus, the jet rest frame. 
Consider a fat jet of p^ = 7771(1, /3), defined by a cone 
with angle, 9 C . In the jet rest frame, the energy of mass- 
less UE particles can be, at most, 7(1 — f3 cos 9 c ) times 
larger than its energy in the lab frame. For UE parti- 
cles with E < lGeV, inside the SISCone jet with a jet 
radius 0.8 and 7 ~ 10, the factor is about 3; soft mass- 
less objects in the lab frame remain soft in the jet rest 
frame. Moreover, they are collimated to the boost axis, 
cos# ~ 0.97, and unlikely to included the leading rest 
frame subjets; to be identified as a single fat jet, the an- 
gles between the axis and the leading rest frame subjets 



are usually more large. Thus, the rest frame subjet al- 
gorithm is infrared and collinear safe if an infrared and 
collinear safe jet algorithm is used for the rest frame sub- 
jet clustering, and the leading rest frame subjets of fat 
jets do not much affected by UE particles. 

In the jet rest frame, we can treat the constituent par- 
ticles of the jet as the final state particles of a fictitious 
event, because it is the center of mass frame for the par- 
ticles. For example, in the rest frame of the Higgs jet, 
the constituent particles look like a dijet event from the 
Higgs produced at rest. To have such a shape, the QCD 
jet should radiate only one hard parton which is improb- 
able. Since the gluon and quark jets are not closed sys- 
tems, their shape in the rest frame does not correspond 
to any physical state, and is more likely to be irregular. 
Thus, by checking whether a shape of a jet in the rest 
frame looks like 'N-jet' event, i.e. by analyzing whether 
the jet has N 'rest frame subjets', we can discriminate be- 
tween the fat jet, from the boosted color singlet particle 
decaying to N partons, and the QCD jet. 

To select jets which have N rest frame subjets, we 
employ the N-jettiness [15]. A global event shape "N- 
jettiness", tn, is devised to filter out events which have 
additional undesired jets beyond the required N jets. 
Treating the constituent particles of the jet as final state 
particles of a fictitious event, we can apply rjy to the jet, 
i.e. calculate "N-subjettiness" , to determine whether the 
jet has N rest frame subjets. tm vanishes in the limit of 
exactly N infinitely narrow jets; we expect the fat jets 
tend to result smaller than that of the QCD jet. 

Except for it is defined by constituent particles of a jet 
instead whole final state particles of a event, the defini- 
tion of t 3 n is identical to that of rj\r. In the jet rest frame, 
we tag N most energetic subjets, and define t ] n as : 

T N = jet , 2 X! min lgl -Pk,Q2 -p k ,---,qN -Pk} (1) 

\ m inv) fc e j 

where pk is four momenta of the constituent particles of 
the jet, J, and qi is four momenta of the N energetic 
subjets. Definitions of tjv and the rest frame subjet are 
infrared and collinear safe; and, thus, t j n is also an in- 
frared and collinear safe observable. 

t 3 n is calculated through following steps. Given a hard 
jet J with p J and its constituent particles {i\i € J} : 

1. Define the boost vector, (3j to transform p^ to 
P J / est = (rr4nv, 0,0,0). 

2. Boost p 1 ^ by f3j, and obtain p l l f est . 

3. Clustering subjet with p i l f est and R su bj e t- 

4. Sort jets into decreasing £T est order and label N 
most energetic subjets. 



1 We found that the similar concept for the top tagging exists in 
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Get four momenta of the N subjets in the lab frame, 
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FIG. 2: t 3 2 distributions of jets with HOGeV < m^* < 
130GeV, p T > 200GeV, \y\ < 2.5. 

6 - T N = {m £t )2 Efeej min{qi • p fc , . . . , q N ■ p k }. 

r| distributions of jets with HOGeV < mf^L < 130GeV, 
p T > 200GeV, \y\ < 2.5 are shown at Fig. |j 

Applying t j n with N = 2, we investigate discovery po- 
tential of the SM Higgs with ran — 120GeV, through 
the fully hadronic pp — > WH, ZH channels where both 
W/Z and H are highly boosted enough to be identified 
as single fat jets. The PYTHIA 6.4.23 [20] with the AT- 
LAS MC09 parameter tune [2T] and the modified leading- 
order MRST2007 parton distribution functions [22] are 
used to generate both the signal samples, pp — > HV, and 
the background samples, pp — > VV, Vj, tt, tV , tj and 
jj events at 14TeV. No K-factors are applied to emulate 
higher order effects. 

FastJet 2.4.2 [23] with the SISCone plugin [15] is used 
for the jet clustering, and the SISCone in spherical coor- 
dinates is used for the rest frame subjet clustering. SIS- 
Cone jet algorithm has two parameters; the jet radius, 
R, and the overlap threshold, /. For the fat jet tagging 
using , the jet radius should be set to maximize the 
mass peak of the signal, and the optimal subjet radius 
should be set to maximize the discrimination power of . 
With rriH = 120GeV, we use R = 0.8, overlap threshold 
/ = 0.75 for jet clustering, and R = 0.6, / = 0.75 for 
subjet clustering. 

The b-tagging, c-jet misidentification and light-jet 
misidentification probabilities, e& , e c and e m is, are crucial 
factors to the analysis of this letter. According to Ref. 
[4], ef, ~ 70% (corresponding to s» 50% signal efficiency) 
with t rn is ~ 1%> and e c ~ 10% is expected to be achieved. 
Although these values are estimated using C/A MD-F 
algorithm, SISCone jet with the rest frame subjet algo- 
rithm can also provide required information; and, thus, 
compatible with it. Therefore, we employ these values 
for the analysis in this paper. For estimating the effects 
on the double b-tag performance of the rest frame subjet, 
separate event samples are generated with B-mesons are 
set to be stable; then, b-tag is assigned to the each subjet 
with probability tb for b-subjet, e c for c-subjet, and e m , s 



for QCD jet. 

To select the signal events which contain two fat jets 
from the boosted H and V of fully hadronic pp — > 
WH, ZH channels while reducing QCD backgrounds, we 
require : 

1. No hard leptons with px > lOGeV, except those 
from B-mesons. 

2. Two hardest jets j\ and j'2 with Ptj 12 > 200GeV 

3. \yj 1 2 | < 2.5 and \yj 1 — yj 2 \ < 2.0 where is ra- 
pidity of jj. 

4. Require third hardest jet, ^'3, to be not hard : 
p T ,j 3 < 30GeV 

5. 6-cut : both of the two energetic rest frame subjets 
of the Higgs candidate jet are b-tagged. 

6. -] nn : r{ n , 7^ < 0.08 

7. cos6> s -cut : in the jet rest frame, angles, 6 S , between 
two leading rest frame subjets and the boost axis 
should be large enough to be cos6> s < 0.8. 

To check whether leptons are comes from semi-leptonic 
decays of _B-meson, we use Pythia's internal data; we 
have checked that it gives good approximation for the 
lepton isolation for the purpose of this paper. 

cos^-cut plays similar roles of symmetry cut, y cu t, in- 
troduced in Ref. [3]. To be reconstructed properly, two 
b-quarks of the Higgs jet should be energetic; and it re- 
sults smaller cos 6 S . cos # s -cut also reduces the effects of 
UE. As mentioned previously, UE particles are concen- 
trated along the boost axis in the jet rest frame. Since 
they are mixed with soft particles from the Higgs jet, 
however, removing these particle is difficult; and remov- 
ing them affects the jet mass. By requiring cos^-cut, 
we can select the fat jets which leading two rest frame 
subjets are not much contaminated by UE, while pre- 
serving the jet mass and reducing QCD backgrounds; for 
SISCone jets, we found that it is a more efficient then 
removing the soft particles near the boost axis. 

We define signal and background events as which one of 
ji ■ 32 is the Higgs candidate jet, i. e. has in the range 
between 120 ± lOGeV and passes 6-cut, and other jet has 
mjnv in the range between [raw ± 10 or mz ± 10)GeV. 
The result before the 6-cut step of this scheme is shown 
at Table [I] The cross section for signal and background 
events after applying 6-cut, Tj-cut or both cuts are shown 
at Table [TT1 Finally, about 36% of QCD dijet backgrounds 
and less than 5% of the Higgs and other background pro- 
cesses are filtered out by cos# s -cut. With C ~ 30fb _1 , 
expected m jet distributions are shown at Fig. [3j note 
that the mass peaks at mw, mz should be clearer than 
the Higgs signals and we expect they can be used for 
the calibrations. The signal / background ratio, S / B, 
is about 30 / 200; and, thus, the statistical significance 
of the signal is about 2a. With a more conservative fa- 
tagging efficiency of 60% and a light-quark jet fake rate 
of 2%, it is decreased to about 1.5er. 
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FIG. 3: Expected m-jet distributions of the signal and back- 
ground events that passes all cuts with £. ~ SOfb^ 1 . Pythia 
with the ATLAS MC09 parameter tune and SISCone jet with 
R — 0.8 is used. The J J sample includes 66. The b-tag ef- 
ficiency, c-jet misidentification and light-jet misidentification 
probabilities are assumed to be 70%, 10%, 1%. 
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TABLE I: Cross section for the signal and background events 
which one of two hardest jet has ra^J, in the range between 
120 ± lOGeV and other jet has va 3 ^ v in the range between 
(raw ± 10 or ra z ± 10)GeV. SISCone jet with R = 0.8 is 
used. 



The uncertainties of the statistical significance of sig- 
nals are largely come from mass resolution of jet masses, 
and modeling of gluon splitting into bb since most back- 
grounds come from light quark jets and gluon jets with 
the gluon splitting. Both of r^-cut and b-tagging use the 



leading subjets' informations and, thus, there is a loose 
correlation between them; a jet of lower t 3 2 is more likely 
to pass b-tag cut. About 2% of light quark jets and 6% of 
gluon jets which have t 3 2 < 0.08 and to^L of 120±10GeV 
is expected to pass 6-cut. 

In conclusion, for the boosted color singlet particle 
searches, we have introduced the jet rest frame, the rest 
frame subjet, and the N-subjettiness. Using SISCone jet 
with r|, we see that, for the fully hadronic pp — > HV 
channels of the SM Higgs with m# = 120GeV, the sta- 
tistical significance of the signals is about 2a for 14TeV 
collisions with L ~ 30fb _1 . Although it will be comple- 
mentary to the known Higgs search channels, the scheme 
suggested in this letter is rather a proof of concept; the 
sch eme will be improved further to increase the sig nal 
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TABLE II: Cross section for for the signal and background 
events with a mass window of 20GeV centered on the mass 
peak after applying 6-cut, r|-cut or both cuts. 



to background ratio, and to make full use of the jet rest 
frame. It involves comprehensive studies on theoretical 
uncertainties of the scheme, and we left them for the fu- 
ture study. The rest frame subjet can be defined by any 
jet algorithms, although the effects of UE, and pileup on 
the scheme depend on the jet algorithm. We also ex- 
pect the scheme can also be employed for highly boosted 
colored particles. 
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